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A technique for background prediction using data, but maintaining a closed signal box is described. 
The result is extended to two background sources. Conditions on the applicability under correlated 
cuts are described. This technique is applied to both a toy model and an analysis of data from a 
rare neutral kaon decay experiment. 

PACS numbers: 06.20.Dk,14.40.Aq 



In this paper we describe a bifurcation analysis pro- 
cedure for data driven background prediction in a blind 
analysis of a closed signal region. In this type of anal- 
ysis we define a signal region, in some set of variables, 
where we keep the events hidden during the analysis. 
We wish to predict the number of background events un- 
der the full set of cuts in this signal region. We use the 
term cut to refer to a specific selection criteria. An event 
that passes a cut is selected and an event which fails a 
cut is thrown out. The procedure uses the application 
of inverse cuts to properly measure the veto power of 
the different sets of cuts while keeping the signal region 
closed. This technique was first developed for a single 
background source in the stopped K + experiments E787 
and E949 at Brookhaven []J . The work in this paper was 
inspired by the use of the bifurcation technique in the 
E391a experiment 

We begin with a derivation of the bifurcation analysis 
in the case of one background source and uncorrelated 
cuts. We then extend this to two background sources and 
a simple model of correlation between cuts. Throughout 
this paper the method will be applied to a toy model 
to predict background and then applied to an example 
from the E391a experiment. We utilize the Mathematica 
software package to implement the toy model 0]. 



ONE BACKGROUND CASE 

We begin discussing this method in the case of a sin- 
gle background source. Here a collection of setup cuts 
have been applied which eliminate all other sources of 
background. Our goal is to predict the amount of back- 
ground in the signal region when we apply the cuts A 
and B, which we refer to as the bifurcation cuts. The 
number of events we observe will be determined by the 
number of events before applying the cuts A and B (after 
applying the setup cuts) and the cut survival probability 
(CSP): 
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We consider events to lie in a multi-dimensional space 
with a dimension corresponding to every variable on 



which we can cut. Our set of cuts defines a multidimen- 
sional signal region which we wish to keep blind. If two 
cuts show no correlation in the events that they cut, this 
implies that these two cuts are orthogonal in this space. 
A diagram of this situation is shown in Fig. [TJ The CSP 
can then be decomposed into P(AB) = P(A)P(B) : 
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This can be expanded into 
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Here A and B are the inverses of cuts A and B, events 
which pass cut A fail cut A. Then we can calculate this 
from data based on the number of observed events in the 
signal region under the different cut conditions: 



N AB = N P(A)P(B), 
N AB = N P(B)P(A), 
N AB = N P(A)P(B). 
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Here N AB is the number of background events observed 
with the application of cut A and the inverse of cut B. 
N AB is the observed background events with the inverse 
of cut A and cut B applied. N AB is the count when 
the inverse of both A and B are applied. All of these 
values are outside the signal region defined in the multi- 
dimensional cut space allowing us to predict the back- 
ground without opening the box: 
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The procedure for producing a background prediction is 
as follows. First, apply setup cuts to the data, the num- 
ber of events in the signal box is iVo- The setup cuts are 
all the cuts other than A or B, which are applied. They 
should remove all other background sources. There is 
more freedom in choosing the setup cuts than cuts A 
and B, since they can be correlated with each other or 
A and B. We then apply cut A and B. By applying B, 
we are looking at events which are outside the signal box 
in the multidimensional space. We count the number of 
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events which pass these sets of cuts, N A §. We then do 
the same procedure in reverse applying the set of cuts B 
and the inverse of A to find N AB . Finally, we apply the 
inverse of both cuts A and B to find N AB ■ These values 
are combined to produce the background prediction. 
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early decreasing density with values between and 1 and 
an uniform distribution between and 1 for b. The dis- 
tribution of each variable for the two background types 
is shown in Table [U The distributions of the kinematic 
variables p and x are shown in Fig. [5] We define a sig- 
nal region by specifying the allowed kinematic variables: 
0.25 < x < 0.75 and 0.25 < p < 0.75. 
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TABLE I: Probability distribution functions for the 
variables of each event type in the Toy Model. 



FIG. 1: Schematic of background distribution in the cut 

space. 

Eqn [7] predicts no background events, if N AB or N AB 
are zero. This can be true for one of three reasons: 
N = 0, P(A) or P(B)=0, or P(A) or P(B) = 0. 
The first two possibilities are what we expect and re- 
flect cases where there should be no background events. 
The third possibility is more problematic. When P(A) 
or P(B) — 0, N AB should also be zero, but statistical 
fluctuations may prevent that from being true. This con- 
dition results from a poor choice of cuts where one cut 
eliminates almost no background events. If possible a 
different choice of cuts for A and B should be made. 

THE TOY MODEL 

Each event is described by four variables. Two kine- 
matic variables, p and x, which are used to describe the 
signal region and two cut variables, and a and b, which 
will be used to define the cuts. The cut variables, a and 
b, are independent of the kinematic variables, p and x. 
All of these variables range from to 1. 

We define 2 different types of events: Background 1 
and Background 2. They both have x variables with 
uniform distributions between and 1. Their p's have 
uniform distributions between and x for Background 1 
and between and 1 — x for Background 2. Background 
1 has an uniform distribution of a between and 1 and 
variable b has a linearly decreasing density with values 
between and 1. Background 2's a distribution is a lin- 
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FIG. 2: Kinematic variable distributions for 
Backgrounds 1 and 2. 

We define our cuts on variables a and b as: 

A = {a > 0.5), (8) 
B = (b> 0.5). (9) 

A and B are true or false statements. If they are false the 
event is cut. With the cut points defined, we can then 
calculate the CSP for each event type: P(A) or P(B). In 
this toy model the CSPs can be calculated analytically 
because we know the underlying distributions. These 
values are shown in Table [TT] 

One Background in the Toy Model 

In this section we discuss the case of a single signif- 
icant background. The background prediction is given 
by Eqn [7) We generated 1 x 10 4 Background 1 events 
over the whole range of kinematic variables. This leaves 
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Event Type 


P(A) 


P(B) 


Background 1 


0.5 


0.25 


Background 2 


0.25 


0.5 



TABLE II: Cut survival probabilities for each event 
type in the Toy Model. 



No 


2236 ± 47 


N AI3 


831 ± 29 


Nab 


280 ± 17 


Nab 


869 ± 29 


Predicted Background 


267.8 ± 20.6 


Observed Background 


256 ± 16 



TABLE III: Single background study for the Toy Model 
with only Background 1. 



m 2200 background events in the signal region before ap- 
plying cuts A and B. In Table Hill we show the observed 
number of events for each combination of cuts, the pre- 
dicted background, and the observed background after 
applying both cut A and B. The predicted background 
of 267.8 ± 20.6 agrees well with the 256 ± 16 observed 
background events. 



TWO BACKGROUND CASE 
Derivation 

The previous derivation applied in the case of a single 
background source. If the background is made up of two 
different background sources, Ao = N\ + A 2 , with differ- 
ent CSPs then there is a correlation introduced between 
cuts that must be accounted for. To do so, we begin by 
replacing Eqns [2] and 0][6] with: 
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N AB = N 1 P 1 (A)P 1 (B) 
N AB = N 1 P 1 (A)P 1 (B) 



N 2 P 2 (A)P 2 {B), 
N 2 P 2 (A)P 2 (B), 
N 2 P 2 (A)P 2 (B), 
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Then our previous calculation of the background has 
a cross term introduced. We wish to find the correction 
to the one background solution. We begin by substitut- 
ing the above definitions into the solution for the one 
background case, Eqn[7l 
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[N 1 P 1 {B)P 1 {A) + N 2 P 2 (B)P 2 (A)]. 



We expand the numerator: 

NabNab = N 2 l P 1 {A)Pi(A)P 1 {B)P 1 {B) 

+ AiA 2 [Pi(yl)P 2 (A)P 2 (P)Pi(P) 
+ P 2 (A)P 1 {A)P 1 (B)P 2 (B)] 
+ N 2 P 2 (A)P 2 (A)P 2 (B)P 2 (B). 
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We multiply Abk g by N^g to allow us to find the dif- 
ference of Eqn[T4land Abk g : 

N hH N AB = [ArP^P^P) + N 2 P 2 {A)P 2 {B)} x 
[NxP 1 {A)P l {B) + N 2 P 2 {A)P 2 (B)} 
= A 1 2 P 1 (A)P 1 (A)P 1 (P)P 1 (P) 

+ NiN 2 [Pi (A)P 2 (A)P 1 (B)P 1 (B) 
+ P 2 {A)P 1 (A)P 2 (B)P 1 (B)] 
+ NlP 2 (A)P 2 (A)P 2 (B)P 2 (B), 



(16) 
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The cross term vanishes if Pi (A) — P 2 (A) and Pi (B) = 
P 2 (B), where for the purposes of the cuts the two back- 
grounds are the same. 

We can simplify the cross term by rewriting the CSPs 
of the inverse cuts in terms of the CSPs of the cuts, 
Pi{A) = 1 — Pi(A). Each element of the cross term has 
the same structure which can be expanded to: 

P i (A)P j (A)P k (B)P e {B) = Pi(A)P k {B)(l - P 3 {A))x 

(l-Pe(B)) 
= P t {A)P k (B) 

- P(A)P 3 (A)P k (B) 

- P i (A)Pj t (B)P e (B) 

+ P l (A)P J (A)P k (B)P i (B). 

(18) 

Summing the elements of the cross term cancels every- 
thing except the terms with two CSPs: 
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We can further simplify the cross term by defining 
A A = P 2 (A) - Pi (A) and A B = P 2 (B) - P(P). 
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The second term in Eqn [501 is not the contribution 
of a particular source to the background prediction. It 
is a correction to the prediction of the total number of 
background events from both sources. 



We now consider the case of Ab = 0. Here the correc- 
tion term is zero, but the contribution from the second 
background is not. We can see that the first term in Eqn 
[201 correctly predicts the background by simplifying Eqns 
liHTgl substituting P(B) for Pi(B) and P 2 {B): 

N hkg = (N x Pi(A) + N 2 P 2 (A))P(B), (27) 

Nab = (NiPi(A) + N 2 P 2 (A))P(B), (28) 

Nab = (NiPi(A) + N 2 P 2 (A))P(B), (29) 

N AB = {NxPi(A) + N 2 P 2 {A))P{B). (30) 



Properties of the Two Background Solution 

We now discuss the behavior of the Eqn [20] This so- 
lution has the reasonable property that it is symmetric 
with respect to the definitions of the cuts A and B and 
the backgrounds 1 and 2. 

We consider the behavior of Eqn(2D]under extreme con- 
ditions. First, we show that under no conditions can the 
total JVbkg be negative. The correction term will have it's 
maximum negative value when Aa = 1 and As = —1 or 
A a = — 1 and Ab = 1. Under these conditions, Nab = 
and iVbkg is undefined. We therefore want to study TVbkg's 
behavior as we approach this limit. We begin by setting 
Ab = — 1 and studying the limit as A^ — * 1 . In this case 
cut B removes Background 1 completely, but the Back- 
ground 1 events which survive AB still contribute to the 
prediction produced by Eqn [71 

The condition that = —1 sets what values the 
CSPs of the B can take: 



P 1 (B) =P 2 {B) = 0, 
P 2 {B) = P X {B) = 1. 



(21) 
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Substituting these values into Eqns fTTirTBI we find: 

Nab — N 2 P 2 (A), (23) 



N AB =N 1 P 1 (A) =N 1 (1-P 1 (A)), 



(24) 



N AS = N 2 P 2 (A) = N 2 (l - P 2 (A)). (25) 

We then substitute these values into Eqn [20] and sum 
the two terms: 
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As A A — > 1, Pi (A) -» 0. Therefore the iV b k g goes to 
0. This indicates that iVbkg never has a non-physical 
negative value. 



Substituting these values into the first term of [2D] and 
simplifying gives: 
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(NiPi(A) + N 2 P 2 (A))P(B). (31) 



This is iVbkg, so the correction is not necessary to prop- 
erly predict the background when one of the A's is zero. 



Interpretation of the Two Background Solution 

It may seem counterintuitive that two backgrounds 
cannot be combined simply. This can best be under- 
stood as the second background introducing an implicit 
correlation. 

As an example, consider two backgrounds which in- 
dividually have no correlation between cut A and cut 
B, but do have different cut survival probabilities. If 
Pi (A) = 0.75 and Pi(P) = 0.75, while P 2 (A) = 0.25 and 
P 2 (B) — 0.25 then the resulting combination of the two 
backgrounds will have a correlation. Events which sur- 
vive cut A are have a greater chance to survive cut B, 
because events which survive cut A are more likely to be 
part of Background 1. Events that don't survive cut A 
are less likely to survive cut B, because they are more 
likely to be part of background 2. Therefore there is a 
correlation between cuts A and P, even though for the 
individual backgrounds they are uncorrelated. 

The values of N\, N 2 , Aa, and Ab are not directly 
accessible in data without opening the signal box. There 
are two options: either derive these values from Monte 
Carlo or from other regions in signal space. Determining 
iVi and N 2 generally will require both an alternative way 
of predicting one of the backgrounds and the value of -/V , 
the total number of background after setup cuts. This 
raises the question whether determining Nq biases the 
analysis. From No and the other observed background 
numbers, N A b, Nabi an d Na§, it is possible to effec- 
tively open the box and count iVbkg- Determining A^ 
and A B also requires additional input. Their values can 
be derived from either Monte Carlo or data outside the 
signal region. 
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Two Background Toy Model 

In our toy model, we can study the effect of multiple 
background sources by varying the relative strength of 
a second background. We begin by calculating with the 
false assumption that there is a single background mode. 
We vary the relative admixture of Background 2. The 
total number of events, iVi + N%, was held constant at 
2 x 10 4 . The discrepancy between the prediction and 
the observed background increases as the the number of 
background events from the second source increases, as 
shown in Fig. [3] 

We now apply the correction term to the background 
prediction (Eqn [20]) . In the case of this toy model, we 
know the values of Ni and Ns because we have set them. 
In a real analysis, it would be necessary to determine 
these values through either Monte Carlo studies or stud- 
ies of different signal regions which are then extrapolated 
into the signal box. The differences in the cut probabili- 
ties, and A Bl also need to be determined from out- 
side sources. In this model = —25% and A B = 25%. 
Since the probability differences are of opposite signs the 
correction is negative and reduces the predicted back- 
ground. 

In Fig. [3l we show the results of keeping the total num- 
ber of background events the same while increasing the 
fraction of Background 2 events. Here only the predicted 
background without correction increases, while the ob- 
served and corrected backgrounds remain relatively flat. 
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FIG. 3: Predicted and observed background for 
different admixtures of a Background 2. Squares are 
predicted background (without second background 
correction), triangles arc the observed background in 
data, diamonds are the corrected prediction. The x-axis 
is the number of generated Background 2 events, the 
total number of events, N% + JVa, was held constant at 
2 x 10 4 . 



CUT CORRELATION 

In the derivation of both the one and two background 
cases, we have assumed that the cuts A and B are un- 



corrected. Of course, in real applications, it is unlikely 
to find two cuts which are perfectly uncorrelated. We 
would therefore like to find some general figure of merit 
to determine how correlation introduces errors into the 
background prediction. The following discussion will as- 
sume only one background source. 



Impact of Cut Correlation 

To derive a correction to the background prediction, 
we use a simple model of cut correlation. We describe a 
case where the cuts have a weak linear correlation. In this 
model, the posterior probability for each cut is different 
than the prior probability. 
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FIG. 4: Cut space with correlated cuts. 

We begin by specifying the background values in terms 
of the CSPs, which are now posterior probabilities, spec- 
ifying the dependance on both cut conditions: 



AW = N P{A\B)P(B) = N P(A)P(B\A), 
N AB = N P{A\B)P(B) = N P(A)P(B\A), 
N AB = N P(A\B)P(B) = N P(A)P(B\A), 
N P(A\B)P(B) = N Q P(A)P(B\A). 
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AB 



(32) 
(33) 
(34) 
(35) 



We proceed in the same fashion as for the two back- 
ground case and substitute these definitions into the so- 
lution (Eqn 7) for the single background, uncorrelated 
case: 



N AB N AB N*P{A\B)P{B)P{A\B)P{B) 



Nab 
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N P(A\B)P(B) 
We assume the correlations are small and relate the 
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different posterior probabilities to each other: 

P{A\B) = P(A\B) - e, (37) 

P(A\B) = P(A\B) + e, (38) 

P(B\A) = P(B\A) - S, (39) 

P(B\A) = P(B\A) + 5. (40) 

The corrections e and S should be small. What we mean 
by small will be defined at the end of the derivation by 
what values are necessary for the corrections which are 
first order in e and S to be negligible. We have the free- 
dom to choose to formulate the correction as a function of 
e or S. We choose to use e. We substitute the definitions 
into Eqn 31: 



P{A\B) _ jjg ^ 7V pred 
P{A\B) Em. ~ N AB ■ 



(47) 



These approximations give us the first order correction 
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N AB P(A\B)+e 

= N (P{A\B)P{B) -eP{B)) 
1 



(41) 
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Returning to our toy model, we introduce a correlation 
between the a and b variables in Background 1. We add 
a term linearly dependent on b to a, and then scale a 
to keep it between and 1 and to reduce the change in 
background due to the change in the average value of a : 

f(a) - (1 + e'),a E [e'b/(l + e'), (1 + e'b)/(l + e)]. (49) 

The variable e' is the knob we use to tune the correlation. 
It is closely related to the variable e that is defined in 
Eqns l38l and l39l as is shown in Fig. [5l 
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Assuming the e term in the denominator is small, we 
expand this result in e: 
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Multiplying this out and keeping the second order terms 
of e gives: 



FIG. 5: The value of e (Eqn [38]) as a function of e' (Eqn 
in the Toy Model. 
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P(A\B)P{B) . We show the predicted and observed background in 
' Fig. [6l As e' increases the background in data increases, 
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The first term with no e factors is N^g. The condi- 
tion for the correlations to have a negligible impact on 
our background prediction is that the e terms be much 
smaller than A ■ A 1 



-ZVbke 
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P(A\B)' 



because the correlation increases the average value a, 
while the predicted background decreases. 

In Fig. [6l we show the prediction with C e added. It 
improves the agreement for a fairly wide range of e. 



EXAMPLE FROM E391A 

For a more realistic example, we discuss the use of 
this technique in the first search for the rare kaon decay 
Kl — » tt tt q i'D at E391a. E391a is dedicated experiment 
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(45) for the search for the rare kaon decay Kl 



■ located 



P(A\B) y P{A\B)- 



These terms require opening the signal box to know the 
correct values of the CSPs. We can however approximate 
these values with less knowledge, under the assumption 
that the number of events in the signal box is small: 



P(B) 



N AB + N AB _ N AB 



No 



No 1 



(46) 



at KEK. The experimental apparatus consists of a 47r 
hermetic photon veto system and an array of Csl, an 
inorganic crystal scintillator, for signal detection. The 
Kl — * 7r 7r ^j/ decay has a Standard Model branching 
ratio of (1.4 ± 0.4) x 10~ 13 [4]. The bifurcation method 
is to predict the background to this mode from data. 

The final state of Kl — > tt tt°vD is four photons from 
the 7r°'s and two unobservable i^'s. Events are recon- 
structed by pairing the photons and calculating the tt° 
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FIG. 6: Predicted and observed background for 
different levels of correlation between a and b in the Toy 
Model. Triangles are data background, squares are 
predicted, and diamonds are predicted plus second 
order e correction fEqnl45|). The x-axis is e. 
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FIG. 7: Predicted and observed background for 
different levels of correlation between a and b in the Toy 

Model. Triangles are data background, squares are 
predicted, and diamonds are predicted plus practical e 
correction (EanHS]). The x-axis is e. 



the fiducial region. The Kl — > n tt vv signal box is de- 
fined as lOOMeV/c < P T < 200MeV/c, 268MeV/c 2 < 
M„o_„o < 400MeV/c 2 , 300cm < Z < 500cm. The distri- 
bution of events in the Pt -mass plane is shown in Fig. 

We divided our cuts into three groupings: the setup 
cuts, cut A which contained most of the photon ve- 
toes, and cut B which contained primarily cuts on pho- 
ton reconstruction quality in the Csl array. Using Eqn 
[7J we first tested the method on regions surrounding 
the signal box. We define a Low Pt region with the 
same bounds on M^o.^o and Z as the signal region and 
P T < lOOMeV/c. The High Mass Region is defined with 
the same bounds on Pt and Z as the signal region and 
400MeV/c 2 < M OT a. OT o < 500MeV/c 2 . The results of this 
are shown in Table IIVI There is a significant discrep- 
ancy in the Low Pt region, this is due to neglecting the 
correlation. 

Pt vs Mass 
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FIG. 8: Pt plotted against mass with all cuts in the 
E391a experiment. Z between 300 cm and 500 cm. The 
rectangular regions correspond to the regions in Table 

Eli- 



decay vertices. The pairing with the smallest separa- 
tion between tt° vertices is selected. The signal region 
is defined in three kincmatical variables: the transverse 
momentum of the 7r°-7r° system, the invariant mass of 
the the 7r°-7r° system, M n o_ n o , and reconstructed decay 
vertex, Z. 

The signal region is selected to avoid the two primary 
background sources: Kl — ► 7r 7r°7r decays and neutron 
interactions with a membrane located downstream of the 
fiducial region. The Kl — > 7r 7r°7r° events have a mass 
peak below the Kl mass and with relatively low P^'s 
of under lOOMeV/c. Their decay vertices cover the full 
range of the fiducial decay region. The neutron related 
events are well separated in the decay vertex having a 
peak at the membrane position, 50 cm downstream of 



TABLE IV: Prediction of background events in 
different regions without correlation or secondary 
background corrections. 



Region 


Nab 


N AB 




Prediction 


Data 


Low Pt 


393 


72 


115 


21.1 ± 3.3 


13 


High Mass 


46 


9 


4 


0.78 ± 0.48 


1 


Signal 


84 


18 


2 


0.43 ±0.32 


1 



We estimated the cut correlation on the Kl — » 7r 7r 7r° 
background in its peak region using both data and Monte 
Carlo. The value of e was found to be (—.049 ± .035). 
We first apply Eqn QS] on the Low Pt region and find 
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a correction of —6.67 ± 4.81. This correction brings the 
prediction in Table ITVl into much better agreement with 
data. We next apply Eqn|35]to the signal region and find 
a correction of (—0.12 ± 0.12) events. 

Low energy neutron interactions are difficult to sim- 
ulate in general and we had the additional difficulty of 
not knowing the precise shape of the membrane in the 
beamline. Therefore, we used data to estimate the level 
of neutron background. The value of N2 was estimated 
by fitting the distribution events in the High Pt -High Z 
region, Z > 500cm and Pt > lOOMeV/c. Events in this 
region are predominately core neutron events. We ap- 
plied a set of loose cuts to ensure high statistics. We 
subtracted off the contribution of the Kl — > tt ^ ^ 
background using a Monte Carlo prediction of their con- 
tribution under the loose set of cuts. The fitted core 
neutron distribution was extended into the signal region 
and the total number core neutron events in the signal 
region was predicted from the distribution. Then the pre- 
dicted number of events in the signal region were scaled 
down using the factor by which the population of events 
in the High Pt -High Z region were reduced. Using this 
procedure we estimate N2 — 2.16 ± 0.03 s t a t. ± 1.05 sys t. . 
The large systematic error comes from the subtraction of 
the Kl — > 7r°7r 7r° contribution in the High Pt -High Z 
region. The differences between the cut survival proba- 
bilities for core neutron events and Kl — * 7r 7r°7r° events 
were found by comparing the difference in cut effective- 
ness between the Kl — > 7r 7r°7r° peak in the Low Mass- 
Low Pt region and the core neutron peak in the High Z 
region. The values which go into the correction and the 
computed correction are shown in Table [Vj 



A A 


A B 


Ni 


N 2 


Correction 


26.4% 


8.6% 


(101.9± 
10.2 ± 1.05) 


(2.1± 
0.03 ± 1.05) 


(0.06 ±0.05) 



TABLE V: Values for the core neutron multiple 
background correction. 



mates of the systematic error made by neglecting the 
correlation and multiple background corrections. This 
results in a background prediction of 0.43 ± 0.32 sta t. ± 
0.13 sys t.. Opening the signal box, we observed a single 
event, consistent with our background prediction. 
DISCUSSION 

The bifurcation analysis technique allows us to produce 
data driven background predictions while still maintain- 
ing a blind analysis. In this paper we have shown how to 
extend the bifurcation analysis to the case of two back- 
ground sources and correlated cuts. 

The correction for a second background source is ef- 
fective for any level of secondary background. It does 
require information beyond which is available directly 
from a blind analysis. It requires a combination of Monte 
Carlo information about the relative strengths of the two 
backgrounds and how the cut survival probabilities vary 
between the two backgrounds. It also requires knowledge 
of Nq, the number of events in the signal box after the 
setup cuts. 

Correlations between the two cuts cannot be handled 
as easily. Even when the correlation between cuts is lin- 
ear, the effects on the background prediction are non- 
linear. Therefore, special care must be taken when se- 
lecting the cuts for the bifurcation analysis to avoid cor- 
relation. 

One particular aspect of the impact of the prediction 
on the cut correlation is it's dependance on the value of 
Nq. This leads to two competing forces in optimizing the 
division of cuts into the setup cuts and the bifurcation 
cuts. From the perspective of minimizing statistical error 
in the bifurcation prediction, one would like a large value 
for No with powerful cuts for cut A and B, so that the 
statistical errors on the terms of Eqn[7|are small. On the 
other hand, a large Nq means the prediction is sensitive 
to small correlations between the cuts. 



The Kl — ► 7r 7r°7r° decay has three ways in which it 
can cause backgrounds depending on the number of pho- 
tons lost through veto inefficiency or fusion of photon 
clusters in the Csl. These three channels are not sep- 
arated in the signal space, so our estimate of cut cor- 
relation includes contribution from these multiple back- 
ground types. 

The statistical uncertainties on both the correlation 
and the core neutron correction are large relative to their 
size. Therefore we decided to use these values as esti- 
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